Various MTSAC bug fixes #1975

avnishn · 2020-08-27T17:15:12Z

fixes to Examples to use the correct num_tasks

fixes to max_episode_length_eval being used by the algorithm

Co-authored-by: Tianhong Dai tianhongdai914@gmail.com

avnishn · 2020-08-27T17:17:32Z

TianhongDai · 2020-08-27T17:22:01Z

@TianhongDai

@avnishn Thanks Avnishn! I think I forget to read contribution guidelines when submit the PR, sorry for making so much troubles to you.

codecov · 2020-08-27T18:02:40Z

Codecov Report

Merging #1975 into master will decrease coverage by 0.10%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1975      +/-   ##
==========================================
- Coverage   93.50%   93.40%   -0.11%     
==========================================
  Files         192      192              
  Lines       10182    10184       +2     
  Branches     1268     1269       +1     
==========================================
- Hits         9521     9512       -9     
- Misses        438      446       +8     
- Partials      223      226       +3

Impacted Files	Coverage Δ
src/garage/torch/algos/sac.py	`98.23% <ø> (ø)`
src/garage/torch/algos/mtsac.py	`93.33% <100.00%> (+0.31%)`	⬆️
src/garage/plotter/plotter.py	`59.77% <0.00%> (-3.45%)`	⬇️
...rage/tf/optimizers/conjugate_gradient_optimizer.py	`83.16% <0.00%> (-2.05%)`	⬇️
src/garage/misc/tensor_utils.py	`78.94% <0.00%> (-1.76%)`	⬇️
src/garage/sampler/multiprocessing_sampler.py	`89.26% <0.00%> (-1.35%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a2fb966...c83d34e. Read the comment docs.

ryanjulian · 2020-08-27T20:23:24Z

examples/torch/mtsac_metaworld_ml1_pick_place.py

+        env_spec=ml1_train_envs.spec,
+        num_tasks=50,
+        steps_per_epoch=epoch_cycles,
+        replay_buffer=replay_buffer,


please only call args as args and kwargs as kwargs.

ryanjulian · 2020-08-27T20:24:40Z

tests/garage/torch/algos/test_mtsac.py

+                    'The correct number of tasks?')
+    obs = torch.Tensor([env.reset()[0]] * buffer_batch_size)
+    with pytest.raises(ValueError, match=error_string):
+        mtsac._get_log_alpha(dict(observation=obs))


no need to test a private method?

This is true, however, the tests are in place to verify the correctness of this implementation. I feel more comfortable having the tests than not. At the same time, it makes no sense to have this as a publicly exposed field because it has no use outside of the algorithm.

ryanjulian · 2020-08-27T20:25:00Z

Please reference the issues you're fixing

maliesa96 · 2020-08-28T01:35:15Z

examples/torch/mtsac_metaworld_mt10.py

+    runner.setup(algo=mtsac,
+                 env=mt10_train_envs,
+                 sampler_cls=LocalSampler,
+                 n_workers=1)


Why limit the number of workers here, and can't we use ray?

avnishn · 2020-08-28T03:55:35Z

@maliesa96 we can't. Tldr; using the ray sampler with the old metaworld envs uses more memory than we have on lab machines.

maliesa96 · 2020-08-28T06:18:30Z

Damn alright, LGTM then.

fixes to Examples to use the correct num_tasks fixes to max_episode_length_eval being used by the algorithm Co-authored-by: Tianhong Dai <tianhongdai914@gmail.com>

Backport #1905, #1975, #1908 to fix problems with max_eval_path_length being not used by mtsac and sac, and add checking for incorrect num_tasks being set in mtsac.

Backport #1905, #1975, #1908 to fix problems with max_eval_path_length being not used by mtsac and sac, and add checking for incorrect num_tasks being set in mtsac. Timelimit.truncated modified only when necessary This issue occurs when there are multiple garage envs that are nested or timelimit truncated = False is included in the environment keys. Previously, our timelimit truncated logic was written with the idea in mind that the key was only added when a time limit truncation occured. If an environment already has timelimit truncated = False in its keys then the previous behavior was to set Done = True which is the incorrect behavior. That was causing performance degradation in MTSAC and MTPPO/TRPO. Now Done is only true in the normal/trivial case, never if timelimit truncated is False.

avnishn requested a review from a team as a code owner August 27, 2020 17:15

avnishn requested review from ahtsan and removed request for a team August 27, 2020 17:15

mergify bot requested review from a team, AiRuiChen and nicolengsy and removed request for a team August 27, 2020 17:15

avnishn mentioned this pull request Aug 27, 2020

fix eval max episodes length bug in mtsac #1952

Closed

ryanjulian reviewed Aug 27, 2020

View reviewed changes

ryanjulian approved these changes Aug 27, 2020

View reviewed changes

avnishn linked an issue Aug 27, 2020 that may be closed by this pull request

mtsac_metaworld_mt50.py sets num_tasks=10 #1948

Closed

avnishn added the ready-to-merge label Aug 27, 2020

avnishn requested review from krzentner and haydenshively August 27, 2020 20:31

maliesa96 reviewed Aug 28, 2020

View reviewed changes

mergify bot requested a review from a team August 28, 2020 01:35

maliesa96 approved these changes Aug 28, 2020

View reviewed changes

mergify bot requested a review from a team August 28, 2020 06:19

Various MTSAC bug fixes

c83d34e

fixes to Examples to use the correct num_tasks fixes to max_episode_length_eval being used by the algorithm Co-authored-by: Tianhong Dai <tianhongdai914@gmail.com>

ahtsan force-pushed the Avnish-mtsac-bug-fixes branch from e191183 to c83d34e Compare August 28, 2020 06:19

maliesa96 approved these changes Aug 28, 2020

View reviewed changes

mergify bot requested a review from a team August 28, 2020 06:26

mergify bot merged commit fc3ddc6 into master Aug 28, 2020

mergify bot deleted the Avnish-mtsac-bug-fixes branch August 28, 2020 07:26

avnishn pushed a commit that referenced this pull request Sep 3, 2020

Backport Various SAC and MTSAC Bug Fixes

233b035

Backport #1905, #1975, #1908 to fix problems with max_eval_path_length being not used by mtsac and sac, and add checking for incorrect num_tasks being set in mtsac.

avnishn mentioned this pull request Sep 3, 2020

Backport #1905, #1975, #1908 #2002

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various MTSAC bug fixes #1975

Various MTSAC bug fixes #1975

avnishn commented Aug 27, 2020

avnishn commented Aug 27, 2020

TianhongDai commented Aug 27, 2020 •

edited

Loading

codecov bot commented Aug 27, 2020 •

edited

Loading

ryanjulian Aug 27, 2020

ryanjulian Aug 27, 2020

avnishn Aug 27, 2020

ryanjulian commented Aug 27, 2020

maliesa96 Aug 28, 2020

avnishn commented Aug 28, 2020

maliesa96 commented Aug 28, 2020

Various MTSAC bug fixes #1975

Various MTSAC bug fixes #1975

Conversation

avnishn commented Aug 27, 2020

avnishn commented Aug 27, 2020

TianhongDai commented Aug 27, 2020 • edited Loading

codecov bot commented Aug 27, 2020 • edited Loading

Codecov Report

ryanjulian Aug 27, 2020

Choose a reason for hiding this comment

ryanjulian Aug 27, 2020

Choose a reason for hiding this comment

avnishn Aug 27, 2020

Choose a reason for hiding this comment

ryanjulian commented Aug 27, 2020

maliesa96 Aug 28, 2020

Choose a reason for hiding this comment

avnishn commented Aug 28, 2020

maliesa96 commented Aug 28, 2020

TianhongDai commented Aug 27, 2020 •

edited

Loading

codecov bot commented Aug 27, 2020 •

edited

Loading